Segmentation of Broken Characters of Handwritten Gurmukhi Script

نویسندگان

  • Bharti Mehta
  • Simpel Rani
چکیده

Character Segmentation of Handwritten Documents has been an active area of research and due to its diverse applicable environment; it continues to be a challenging research topic. The desire to edit scanned text document forces the researchers to think about the optical character recognition (OCR). OCR is the process of recognizing a segmented part of the scanned image as a character. OCR process consists of three major sub processes pre processing, segmentation and then recognition. Out of these three, the segmentation process is the most important phase of the overall OCR process. Different problems in the characters segmentation of handwritten text is due to the different writing style of different people because the size and shape is not fixed while we write any text. In this work, we formulate an algorithm to segment the scanned document image as a character. According to proposed algorithm, broken characters in Gurmukhi script, we used the segmentation of these characters that can become easily identify how many characters are in one word. To develop the algorithm to segment the characters from a word we are using combinations of two approaches which are Horizontal Profile Projection and Vertical Profile Projection. And get the accuracy is 93%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Segmentation of Isolated and Touching Characters in Offline Handwritten Gurmukhi Script Recognition

Segmentation of a word into characters is one of the important challenges in optical character recognition. This is even more challenging when we segment characters in an offline handwritten document. Touching characters make this problem more complex. In this paper, we have applied water reservoir based technique for identification and segmentation of touching characters in handwritten Gurmukh...

متن کامل

A Script Independent Technique for Extraction of Characters from Handwritten Word Images

A script independent character segmentation from word images technique has been reported here. Word to character segmentation is an important preprocessing step of optical character recognition process. But in case of handwritten text, presence of touching characters decreases the accuracy of the technique of the segmentation of the characters from the word. In this paper, segmentation of handw...

متن کامل

A Technique for Segmentation of Gurmukhi Text

This paper describes a technique for text segmentation of machine printed Gurmukhi script documents. Research in the field of segmentation of Gurmukhi script faces major problems mainly related to the unique characteristics of the script like connectivity of characters on the headline, two or more characters in a word having intersecting minimum bounding rectangles, multicomponent characters, t...

متن کامل

Statistical Feature Extraction Methods for Isolated Handwritten Gurumukhi Script

In this Paper we have used moment based Statistical methods for Feature extraction like Zernike, Pseudo-Zernike methods. Handwritten Gurmukhi isolated characters are used for feature extraction. Our database consists of 75-115 samples of each of 40 characters of Gurmukhi script collected from different writers. These samples are preprocessed, normalized and scaled to 48*48 sizes. Feature extrac...

متن کامل

Online Handwritten Gurmukhi Character Recognition

Computers are greatly influencing the lives of human beings and their usage is increasing at a tremendous rate. The ease with which we can exchange information between user and computer is of immense importance today because input devices such as keyboard and mouse have limitations vis-à-vis input through natural handwriting. We can use the online handwriting recognition process for a quick and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014